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The closely related transcription factors (TFs), estrogen receptors ERa and ERp, regulate divergent 
gene expression programs and proliferative outcomes in breast cancer. Utilizing breast cancer cells 
with ERa, ERp, or both receptors as a model system to define the basis for differing response 
specification by related TFs, we show that these TFs and their key coregulators, SRC3 and RIP 140, 
generate overlapping as well as unique chromatin-binding and transcription-regulating modules. 
Cistrome and transcriptome analyses and the use of clustering algorithms delineated 11 clusters 
representing different chromatin-bound receptor and coregulator assemblies that could be 
functionally associated through enrichment analysis with distinct patterns of gene regulation and 
preferential coregulator usage, RIP140 with ERp and SRC3 with ERa. The receptors modified each 
other's transcriptional effect, and ER(3 countered the proliferative drive of ERa through several novel 
mechanisms associated with specific binding-site clusters. Our findings delineate distinct 
TF-coregulator assemblies that function as control nodes, specifying precise patterns of gene 
regulation, proliferation, and metabolism, as exemplified by two of the most important nuclear 
hormone receptors in human breast cancer. 
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Introduction 

Members of the superfamily of nuclear hormone receptors 
(NHRs) have pivotal roles in almost all aspects of physiology, 
including reproduction, metabohsm, and immune function 
(Bookout et aU 2006). The estrogen receptor-a (ERa) and -p 
(ERP) belong to this superfamily of transcription factors (TPs) 
and mediate the diverse actions of estrogens, including their 
involvement in several disease states, such as breast cancer 
(Katzenellenbogen and Katzenellenbogen, 2000; Nilsson et aZ, 
2001; Deroo and Korach, 2006} . Although encoded by different 
genes, ERa and ERp share the same general modular protein 
structure common to the NHR superfamily, and have nearly 
100% amino acid sequence homology in their DNA-binding 
domain. As such, it has been previously shown that ERa and 
ERp are able to bind to similar DNA elements. They also show 
59% amino acid homology in their Hgand-binding domains, 
and can heterodimerize as well as act as homodimers (Cowley 
et aU 1997; Pace et aU 1997). 

It has been well documented that ERa controls the 
transcription of hundreds of genes (Prasor et aU 2003; 
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Hah et aU 2011), and that its presence in breast cancer cells 
drives enhanced prohferation in response to estrogens (Prasor 
et aU 2003; Chang et aU 2008). Therefore, ERa is the main 
therapeutic target in ER-positive breast tumors, which 
represent two-thirds of all breast cancers. The precise role of 
ERP and the manner in which it can impact estrogen actions in 
breast cancer, however, are less clear. ERP is expressed in ca. 
70 % of human breast tumors, often along with ERa, with some 
human breast tumors expressing only ERP (Kurebayashi et aZ, 
2000; Speirs et aU 2004; Saji et aU 2005; Skliris et aU 2006). 
Although several reports have implicated ERp as having net 
antiprohferative effects in breast cancer cells (Lazennec et aU 
2001; Paruthiyil et al, 2004; Strom et al, 2004; Chang et aU 
2006; Lin et al 2007 sl; Williams et al 2008), elucidation of the 
mechanistic basis for the seemingly contrasting actions of 
ERa and ERP in breast cancer cells, including delineating the 
manner in which the genes involved are differentially selected 
for regulation by ERa and ERp, and mapping of the signaling 
pathways utilized, remain critical issues. 

When ERa and ERP bind their ligand, 17p-estradiol (E2), 
they undergo conformational changes that release heat shock 
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proteins, enhancing receptor dimerization, interactions with 
coregulators (Skliris et aZ, 2006; Xu et aU 2009), and binding to 
the regulatory regions of target genes. ERs can be targeted to 
chromatin by direct recognition of estrogen response elements 
(EREs) through the agency of pioneer factors (e.g., FOXAl, 
GATA3, and PBXl) that modify the chromatin environment to 
a more permissive state, or via tethering to other TPs (e.g., Spl 
and API; Ah and Coombes, 2000; Glass and Rosenfeld, 2000; 
McKenna and O'Malley, 2002; Fullwood et al, 2009; Stender 
et al 2010; Rosell et al, 2011; Jozwik and Carroll, 2012). 

Given the fact that both ERs can potentially recognize 
similar chromatin-binding sites, interact with a largely 
overlapping set of coregulators, and form both homo- and 
heterodimers in order to regulate gene expression and cell 
phenotypic properties, we explored how estradiol can ehcit 
contrasting phenotypic outcomes— proprohferative versus 
antiproHferative — through these two closely related TPs. 
In this report, we have undertaken an integrative 
genomic approach to map in a comprehensive manner the 
chromatin-binding interactions of ERa and EPp, and their key 
coregulators, SRC3 and RIP140 (Cavailles etal 1995; Glass and 
Rosenfeld, 2000; Xu et al 2000; Rosell et al 2011), in the same 
cell background when the receptors are present alone or 
together. The use of novel clustering algorithms enabled us to 
associate the distinct chromatin-binding landscapes of these 
receptor and coregulator modules with ER-regulated gene sets 
that delineate the specific cellular pathways and regulatory 
programs underlying the distinct phenotypic outcomes 
induced by hormone working through these two important 
NHRs in breast cancer cells. 

These integrative and clustering approaches, delineating 
distinct genome-wide patterns of chromatin binding of 
receptors and coregulators with gene expression behavior 
and functional outcomes, can be apphed broadly to elucidate 
the molecular underpinnings for the transcriptional regulation 
and physiological effects of any TP in response to extrinsic or 
temporally modulated stimuh. 

Results 

Genome-wide analysis of ERa, ERp, SRC3 and 
RIP140 chromatin binding by ChlP-seq 

Although ERa and EPP have high structural and sequence 
homology, especially in their DNA-binding domains, it is not 
known whether these closely related receptors, in the same 
cell background, would substitute for one another when 
present alone, whether they would synergize or antagonize 
each other at different regulatory gene sites when present 
together, and how their utilization of coregulators might 
contribute to their specification of activities at the many gene 
regulatory sites to which these ERs bind. To compare genome- 
wide cartographies of ERa and EPp, and their modulation of 
gene expression in these contexts, we utilized MCP-7 breast 
cancer cells that endogenously express only ERa, or cells 
expressing only EPp (adenovirally expressed EPp with knock- 
down of ERa via RNAi), or both ERa and EPp at similar levels 
(termed EPa/EPp cells). The presence of ERa and/or EPp 
protein in the three cell types used for genome-wide analyses 
was confirmed by western blot (Supplementary Figure 1). 
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Chip samples for ChlP-seq analysis and total RNA for cDNA 
microarray analysis under vehicle control or 10 nM estradiol 
treatments were prepared in parallel in these three cell 
contexts (see Figure lA for the experimental scheme). 

For cistrome analysis cells were treated with hormone for 
45 min, whereas for gene expression studies total RNA was 
collected after an early (4 h) and a late (24 h) time of hormone 
treatment to gain insight into both rapid and more delayed 
responses. Whole-genome binding sites profiled by the ChlP- 
Seq approach were analyzed using MACS (Model-based 
Analysis of ChlP-Seq; Zhang et al 2008), to identify binding 
sites with high confidence. Multiple rounds of ChlP-seq data 
analysis were then carried out using the integrated ChlP-seq 
data interpretation program, seqMINER (Ye et al 2011), to 
identify, by clustering algorithms, subgroups of genomic loci 
for ERa, EPp, and the coregulators SRC3 and RIP140 having 
similar compositional features. Further analyses were then 
done to relate subgroups of genomic binding sites (clusters) 
with binding sites for other TPs, and with genes regulated by 
estradiol in the three receptor cell backgrounds. 

We first compared ERa and EPp cistromes when the two 
receptors were present alone (ERa versus EPp cells) . As shown 
by the Venn diagram in Figure IB, out of ca. 38 000 high- 
confidence binding sites for each receptor in ERa or EPp cells, 
only about 40% (15 000) overlapped, indicating that despite 
homologous DNA-binding domains the two ERs bind to largely 
distinct chromatin regions. Notably, when both ERs were 
present together in cells, the number of binding sites for each 
ER was markedly reduced (to ca. 20 000), suggesting that 
under this condition a more complex chromatin interplay 
causes a mutual restriction in the total number of sites 
accessible to ERa and EPp. Approximately half (7600) of the 
DNA regions bound by both ERa and EPp in the EPa/EPp cells 
overlapped with binding sites occupied by the receptors 
in the ERa- or EPp-only cells. Moreover, we identified ca. 
2500 binding sites for ERa and ca. 3000 sites for EPp, which 
were unique to the EPa/EPp cells, further emphasizing that 
cells expressing both receptors present a different cistromic 
landscape. 

To examine whether the concurrent presence of key 
coregulators for ERs would provide better stratification 
of ER-binding sites, we performed ChlP-seq for the key 
coregulators SRC3 and RIP140, known to be involved in 
estrogen-controlled mechanisms of transcriptional regulation 
(Cavailles et al 1995; Augereau et al 2006; Lanz et al 2010; 
Rosell et al 2011; Madak-Erdogan and Katzenellenbogen, 
2012). In case of RIP140 (Figure IC), we observed >2000 
binding sites in ERa and EPa/EPp cells, with an overlap of 
ca. 1400. Interestingly, in the EPp cells the number of RIP140- 
binding sites was more than fourfold increased, to ca. 9000, 
suggestive of a major shift in the role of RIP140 when EPp is 
present alone. On the other hand, this major change with EPp 
was not seen for SRC3, where only a modest (ca. 1.5-fold) 
increase in the number of SRC3 -binding sites was observed 
in the EPp cells (Figure ID). 

We then overlaid the coregulator cistromes with the ER 
cistromes (Figures IE and F). As apparent from the Venn 
diagrams, we observed that the great majority of coregulator 
binding sites (>80%) were associated with genomic regions 
bound by ERa and/or EPp, clearly showing that ERs are the 
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Figure 1 Genome-wide analysis of ERa, ERp, SRC3 and RIP140 cliromatin binding by ChlP-seq. (A) Experimental design for ChlP-seq, gene microarray, 
and clustering analyses. Veh, vehicle; E2, 10 nM 17p-estradiol. (B) Comparison of ERa- and ERp-binding sites in the different cell backgrounds. (C) Comparison of 
RIP1 40-binding sites in the different cell backgrounds. (D) Comparison of SRC3-binding sites in the different cell backgrounds. (E) Comparison of the number of ER and 
coregulator (SRC3, RIP140)-binding sites in ERa cells or ERp cells. (F) Overlap of RIP140- or SRC3-binding sites with ERs in ERa/ERp cells. 



major hubs for chromatin locahzation of these coregulators 
after hormone treatment. 



Categorization of binding-site clusters based on 
ERa, ERp, SRC3, and RIP140 cistromes 

To parse the wealth of data obtained from ChlP-seq analyses, 
we examined whether the combinatorial usage of binding sites 
for ERs and coregulators might contain more physiologically 
and/or mechanistically relevant information. We therefore 

© 2013 EMBO and Macmillan Publishers Limited 



utilized a clustering approach using the seqMINER software 
(Ye et aU 2011), which provided a more comprehensive view of 
genome occupancy of ERs and coregulators across the 
different cell backgrounds (Figure 2A}. This software 
compared the presence of the multiple factors at a given 
chromosomal location within a 300-bp window in both 
directions and clustered together those binding sites that 
shared a similar pattern of factor localization. 

In some of the clusters, we observed a consistent set of 
ca. 100-bp steps in the location of the binding sites for the factors 
with respect to ER. As the 300-bp window in both directions 
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Figure 2 Categorization of binding sites into clusters based on ERa, ERp, SRC3, and RIP140 cistromes. (A) Clustering of the binding sites. seqMINER software was 
used for clustering based on colocalization of different factors in different cell backgrounds within a 300-bp window in both directions. (B) Conservation of binding sites 
among vertebrates: cistrome conservation tool was used to compare conservation of binding sites from different clusters among vertebrates. (C) Genomic location of 
binding sites: web-based CEAS tool was used for identifying genomic location of binding sites from each cluster. 




encompasses nearly three nucleosomes these factors are 
presumed to be recruited into the same complexes, but might 
have been crosslinked by the ChIP process to neighboring 
nucleosomes with respect to ER. These displacements are 
highly consistent in magnitude among different data sets, even 
when our data are compared with that of different factors from 
different laboratories (such as FOXAl and GATA3; 
Supplementary Figure 2) . This suggests that we are observing 
a biologically significant phenomenon that is inherent to these 
complexes and the recruitment process. 

From these analyses, we identified 11 distinct clusters 
representing groups of loci having similar receptor and 
coregulator composition (Figure 2A}. Cluster 1 represents 
ERa-binding sites observed in ERa cells that were lost upon 
introduction of ER(3 (ERa/ERp cells) and were not bound by 
ERP in ERp-only cells. Clusters 2 and 3 contained ERot- and/or 
ERp-binding sites in three (C2} or two (C3) of the cell contexts. 
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Clusters 4-7 represent ER-binding sites that were occupied 
by either receptor in all of the cell backgrounds. Clusters 8 
and 9 contained binding sites predominantly in cells 
containing ERP (ERa/ERp or ERp-only), and Clusters 10 and 
11 contained binding sites that were occupied by ERp in ERp- 
only cells. Avery interesting observation was that the presence 
of either SRC3 and/or RIP140 indicated that a binding site 
would be occupied by either ERa and/or ERP in all the 
cell backgrounds (Clusters 4 and 5), whereas the two 
coregulators were virtually absent at binding sites in Clusters 
1-3 and Clusters 10 and 11, which represent ERa and ERp 
unique sites, respectively, and these coregulators were also 
reduced in Clusters 6-9. ER bound at sites in these clusters 
might use alternative coregulators. As discussed below. 
Clusters 4-7 are associated with many of the classic 
estrogen-regulated genes, such as TFFl/pS2 (trefoil factor 1} 
and GREBL 
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Associations of clusters with F0XA1- and 
GATA3-binding sites 

With the pubhc availabihty of other ChlP-seq data sets from 
MCF-7 ERa cells, we combined our cistrome analysis with 
genome-wide data on FOXAl (GSE data set 26 831 (Joseph 
etaU 2010}} and GATA3 (GSE 29 073 (Kong etal 2011}}, which 



are reported to be important pioneering TPs for ER-binding site 
determination and epithelial lineage differentiation (Carroll 
etal 2005, 2006; Eeckhoute etal 2007; Lin etal 2007b; Lupien 
et al 2008; Hurtado et al 2011; Kong et al 2011}. By doing so, 
we were able to further stratify Cluster 1 into three smaller 
clusters (Clusters la, lb, and Ic; Pigure 2 A} that are 



Table I Clusters and their associated GO terms and enriched TF-binding motifs. 



Cluster number and 
receptor sites 

(and cells} Associated GO terms Enriched TF-binding motifs 



Cluster 1 Gene associated in ER+ tumors, downregulated in Tam-resistant tumors ER, GATA3 (Clc} 

ERoc (oc cells} Abnormal cell migration, mammary gland epithelium development FOXAl, other FOX family members 

Epithelial cell proliferation [Clb-c} 

Mammary gland development 



Cluster 2 

ERa (a,oc/p} 

ERp m 


Mammary gland development 

Abnormal lactation, mammary gland, liver, and thymus development 
Gene downregulated in tamoxifen-resistant breast cancers 


ER 


Cluster 3 


Mammary gland alveolar hyperplasia 


ER 


ERa (a/p} 


Increased cell proliferation 




ERp (p} 


Epithelial cell development 






Mammary gland epithelium development 






FOXAl TP network 





Cluster 4 Epithelial cell differentiation ER, AP2a, Zicl, VDR 

ERa (a,a/p} Mammary gland branching 

ERp (a/p,p} Genes associated with tamoxifen resistance in breast tumors 

SRC3 (a,a/p,p} 
RIP140 (a,a/p,p} 



Cluster 5 

ERa (a,a/p} 
ERP (a/p,p} 
SRC3 (a/p,P) 
RIP140 (P} 


Fatty acid metabolism 
Regulation of NF-kB activity 
Liver inflammation 
IGF-1 signaling pathway 


ER, AP2a, Snail, SPl 


Cluster 6 


PI3K binding 


ER, HNP4, API 


ERa (a,a/p} 


Epithelial cell proliferation 




ERp (a/p,p} 


Prostate gland development 






Focal adhesion 






Endometrial carcinoma 





Cluster 7 

ERa (a,a/p} 
ERP (a/p,p} 
RIP140 (P} 



Cellular response to E2 
Adipose tissue development 
Insulin receptor subunit binding 
Mammary gland epitheUa development 



ER, Nurr, ROR, COUP-TP, HNP4 



Cluster 8 

ERp (p} 


Ubiquitin conjugating enzymes 
Cell cycle and apoptosis 


ER, E2F, ETS, CREB 


Cluster 9 

ERP (a/p} 


Mammary gland alveolus development 
Mammary gland epithelial cell proliferation 
Increase cell proliferation 


ER, AP2a, STAT3, p300, PAX-4 


Cluster 10 

ERP (P} 


Osteoblast development 

Mammary gland epithelial development 

Response to corticosterone 

p38 signaling pathway 


ER, Snail, E2F 


Cluster 11 

ERP (P} 


Apoptosis signaling pathway 
Cell proliferation 

Genes decreased during metastasis 
p38 signaling 


ER, AP2a, SPl, ERR, LM02, Hairy 



Web-based GREAT software was used to determine the GO grouping of genes that map to each cluster (20 kb window). Cistrome, Seqpos, and CEAS softwares were 
utihzed to determine enriched TF-binding motifs. 
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distinguished by having (Clusters lb and Ic) or lacking 
(Cluster la) FOXAl and/or GATA3 binding, overlapping with 
ERa-binding sites (Supplementary Figure 2). FOXAl- and 
GATA3 -binding sites were also associated with Clusters 4 and 6 
but, interestingly, not with Clusters 2, 3, 5, 7, or 8, despite ERa 
binding in these clusters. We also observed no association of 
GATA3 and FOXAl with Clusters 9, 10, and 11 specific for ER(3 
DNA binding, implying that FOXAl - and GATA3 -binding sites in 
ERa cells do not predict ER(3 binding to chromatin in ERp cells. 

Enriched TF-binding motifs and binding-site 
mapping in the clusters 

To further examine these clusters, we performed TF motif 
analysis prediction for each cluster in Figure 2A, using Seqpos 
and CEAS (ds-regulatory element annotation system; Shin 
et aU 2009} softwares (http://liulab.dfci.harvard.edu/CEAS/ 
usermanual.html}. Table I shows the most highly enriched 
binding-site motifs for each cluster. As expected, the ER motif 
was the most highly enriched motif, but other motifs 
were characteristic of specific clusters (Table I, and see 
Supplementary Tables 1 and 2 for more detailed analysis of 
enriched TF motifs in the ER-binding sites in Clusters 1-11, 
based on SeqPos and CEAS analysis), suggesting a diversity of 
pathways likely to be represented after Gene Ontology (GO) 
analysis (described below). Also consistent with our cistromes 
overlay analysis, FOXAl- and GATA3 -binding motifs were 
enriched only at ERa-binding sites (Table I). Interestingly, 
motif enrichment analysis failed to show any major differences 
in the composition of TF-binding motifs or strength of EREs 
between different clusters, suggesting that the selectivity for 
binding might be due to certain epigenetic marks or other 
factors (such as FOXAl and GATA3) that render these sites 
accessible for ER binding with or without SRC3 and/or RIP 140 
in different cell backgrounds (Supplementary Figure 2) . 

Regarding the genomic location of the receptor binding sites, 
ERa and ERp were found to bind preferentially to distal 
enhancer regions, followed by intronic regions, in almost all 
clusters (Figure 2B). Notably, however, almost 50% of ER- 
binding sites in C8, 16% in C9, and 8% in CIO mapped to 
proximal promoter regions. These clusters showed enrichment 
for cell cycle-related genes, and E2F predicted binding site 
motifs (Table I) . Another notable feature of cluster 8 was its 
high sequence conservation among different species, which 
was the highest of all clusters (Figure 2C). 

Patterns of E2-regulated gene expression in 
ER(x, ERo/ERp, and ERp cells, and their association 
with ER and coregulator cistromes and cellular 
pathways 

To understand the relative importance and functional out- 
comes of each of the identified clusters of ER and coregulator 
binding sites in each cell background, we compared gene 
expression cDNA microarrays in the three (ERa, ER(3, and 
ERa/ERP) cell backgrounds at two time points (4 h and 24 h) . 
After data normalization and clustering analysis of the three 
cell transcriptomes (Figure 3 A-B) , we found that at the 4-h time 
point estradiol regulated, both up or down, a similar number of 
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genes in ERa cells (657) or ER(3 cells (674). In cells with both 
receptors, there was a greater overlap with ERa cells (~30%), 
and more importantly there was a significant number of genes 
that were uniquely regulated in ERa cells (253), in ERp cells 
(555), or in ERa/ERp cells (243). Hence, ERa and ERp, alone 
and together, regulate some common but also a large number 
of unique transcripts. 

We observed that the number of genes regulated in all cell 
backgrounds was greater at 24 h versus 4 h (1296 versus 657 in 
ERa cells, 1133 versus 658 in ERa/ERp cells, and 988 versus 
674 in ERP cells, respectively). As an alternative way of 
describing the data, we generated scatter plots of the regulated 
genes in each cell background (Figure 3C), which highlight 
two main trends in this data. One is that the direction of 
regulation by ERa (up (shown in red) or down (shown in 
blue)) does not change with the copresence of ERp or in the 
ERp-only cells (i.e., if a gene is regulated in all the cell 
backgrounds, it will always change in the same direction); 
and two, the magnitude of the E2 response varies among cell 
backgrounds, generally being highest in ERa cells, followed 
by ERa/ERp cells and then ERP cells. 

To investigate how changes in hormone-mediated gene 
expression are related to receptor and coregulator binding 
events, we mapped the closest E2-regulated gene that resides 
within a 20-kb window of an ER-binding site. We found that 
two-thirds of all of the E2-regulated genes harbored at least one 
receptor binding site in its proximity (2861/4190 genes; 
Figure 4A), and often harbored multiple sites for different 
clusters (Figure 4B and C). This allowed us to use powerful 
statistical means to evaluate the predictive power of the 
ER-binding site clusters, described in Figure 2A, in determin- 
ing the direction and magnitude of E2 -dependent gene 
regulation with different cell background and duration of 
ligand treatment (4h and 24 h; Figure 4B). For this, we 
calculated enrichment of E2-regulated genes relative to all the 
genes with a binding site in the respective cluster, and then 
performed clustering. 

From this analysis (Figure 4B) we found that, overall, 
upregulated genes in the three cell contexts at both times were 
most highly enriched in binding sites from Clusters 2, 4, and 6. 
By contrast, downregulated genes overall were enriched in 
sites from Clusters lb and Ic, and in most cases showed deficits 
in binding sites from several other clusters (e.g., C4, C3, and C7). 
Other patterns of binding-site enrichment and deficits could be 
associated with particular upregulation or downregulation of 
gene expression in different cell contexts or times. Thus, this 
clustering approach not only is able to stratify ER-binding sites 
based upon co-occupancy of these TFs and coregulators, but also 
has predictive value for the direction and time of E2-mediated 
gene regulation for any given cell background. 

To further examine the relationship between hormone- 
regulated genes and the binding sites from different clusters, 
we clustered the BED files for the genes that map to binding 
sites in each cluster with those for hormone-regulated 
genes at different times of ligand treatment in different cell 
backgrounds that contained at least one ER-binding site 
(Figure 4C). This schematically illustrates that hormone- 
regulated genes are highly likely to have more than one type 
of ER-binding site, often having representatives from several 
different clusters. 
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Figure 3 Patterns of E2-regulated gene expression in ERa, ERa/ERp, and ERp cells. (A) Comparison of number of 17p-estradiol (E2) regulated genes after 4 h of 
ligand treatment in different cell backgrounds. Genespring Venn Diagram tool was used to compare overlap between genes regulated ^ 1 .8-fold with FDR of 0.01 in 
each background. (B) Comparison of number of E2-regulated genes after 24 h of ligand treatment in the different cell backgrounds. (C) Scatter plots for gene regulations 
in the different cell backgrounds: scatter plots of E2-regulated genes in each cell background were generated using the Genespring scatterplot tool. The top three panels 
show how genes that are regulated by E2 in ERa cells are also being regulated in ERa/ERp cells and in ERp cells. The middle three panels show how genes that are 
regulated by E2 in ERa/ERp cells are also being regulated in ERa cells and in ERp cells. The bottom three panels show how genes that are regulated by E2 in ERp cells 
are also being regulated in ERa cells and in ERa/ERp cells. E2-upregulated genes are shown in red and E2-downregulated genes are shown in blue. 
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Figure 4 Association of ER-mediated gene expression witli receptor and coregulator cistromes. (A) Comparison of number of genes witli at least one binding site 
witliin 20 kb and E2-upregulated or -down regulated genes. (B) Association of gene regulations in different cell backgrounds with binding site clusters: enrichment of 
binding-site clusters (C1-C11) associated with upregulated and downregulated genes in each cell background after 4 or 24 h of estradiol treatment is shown. Level of 
enrichment is designated by the color scale. Cluster 3.0 was used to cluster the data. Data were visualized using Java Treeview. (C) Clustering of genes that are 
regulated by E2 treatment and genes that have binding sites from different clusters: BED files for E2-regulated genes were generated using Genespring. The seqMINER 
enrichment based method was used to calculate the density array for the E2-regulated gene BED files and BED files obtained for each cluster from Figure 2A. Density 
array was clustered using Cluster 3.0 and data was visualized using Java Treeview. (D) The pS2 gene and associated binding sites for ERa, ERp, RIP140, and SRC3 in 
the three cell backgrounds. (E) The pS2 gene regulation by E2 (10"^^ to 10"^M) in the different cell backgrounds. (F) The pS2 gene ChIP for ERa, ERp, and 
coregulators in cells treated with control vehicle (0.01% ethanol) or 10 nM E2. (G) The PgR^ene and associated binding sites in cells treated with control vehicle (0.01% 
ethanol) or 10 nM E2. (H) The PgR^ene regulation by E2 in the different cell backgrounds. (I) The PgR^ene ChIP for ERa, ERp, and coregulators in cells treated with 
control vehicle (0.01% ethanol) or 10 nM E2. (J) The 0TUB2Qene and associated binding sites in cells treated with control vehicle (0.01% ethanol) or 10 nM E2. (K) The 
0TUB2^ene regulation by E2 in the different cell backgrounds. (L) The 0TUB2^ene ChIP for ERa, ERp, and coregulators in cells treated with control vehicle (0.01% 
ethanol) or lOnM E2. 



Cell receptor background-specific effects on 
profiles of gene regulation and ERot-, ERp-, 
SRC3-, and RIP140-binding sites 

We investigated the cell background-specific effects on 
selected E2-regulated genes and ER and coregulator binding 
sites. For each example provided, we show a Genome Browser 
view of the genomic region of interest with the highlighted 
ER-binding site patterns in each cell background, a dose- 
response curve of mRNA expression, and ChIP assays for ERa, 
ERP, SRC3 and RIP140 in the three cell backgrounds. For 
example, the well-known estrogen-regulated gene TFFl/pS2 
has binding sites from Clusters 1,2, and 4 (Figure 4D); its mRNA 
(Figure 4E} is upregulated by its natural hgand, E2, in all cell 
backgrounds, albeit to a different magnitude, with the highest 
stimulation being in the ERa/ERp cells. Also, the E2 dose- 
response analysis reveals a change in sensitivity to E2, evident 
in the ERa/ERp cells, where the EC50 for E2 is close to 10 " M, 
compared with 10 ~ to 10 ~ ^° M in ERa or ERp cells. By ChIP 
assays (Figure 4F}, it is also evident that there is an increase in 
ERa recruitment when ERp is present, and a slight increase in 
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ERp recruitment when present with ERa, and this is mirrored 
also in elevated SRC3 and RIP140 recruitment in ERa/ERp 
cells. For the progesterone receptor {PgR) gene, which harbors 
binding sites from Clusters 1,2, and 7 (Figure 4G), there were 
no major differences in magnitude of response or recruitment 
of ERs in the three cell backgrounds (Figure 4H and 1); 
however, a change in sensitivity to E2 was observed but 
opposite in direction to that seen for TFFl (EC50 10~^°M for 
ERa cells, 10"^ M for ERa/ERp or ERp cells). Finally, for the 
otubain2 {0TUB2) gene, which we previously identified 
as an ERp target gene (Chang et aU 2006, 2008], as seen in 
Figure 4J-L, we confirmed little regulation of 0TUB2 mRNA in 
ERa cells, and little ERa recruitment and no stimulation of 
SRC3 or RIP140 recruitment in these cells; by contrast, 
maximal stimulation of the 0TUB2 gene expression was seen 
in ERp and ERa/ERp cells (Figure 4K and Supplementary 
Figure 3). Interestingly, ChIP assays showed that ERa could be 
recruited to the gene regulatory site when ERP was also present 
(ERa/ERp cells), and this correlated with increased recruit- 
ment of SRC3 and RIP140 in this cell background (Figure 4L). 
ERp showed increased recruitment in response to E2 in 
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pathway 



Figure 5 Pathway analysis of the E2-regulated genes associated with the ERa and ERp-binding site clusters. (A) Clustering of gene ontologies that are differentially 
enriched in ERa cells versus ERp cells: ClueGO plugin of Cytoscape was used to identify differentially regulated GO terms between ERa and ERp cells. (B) Expanded 
view of M-phase-associated clusters. (C) Expanded view of additional M-phase-associated clusters. 



ERa/ERp and ERP cells that was accompanied by increased 
recruitment of both coregulators in these two cell backgrounds 
(Figure 4L). At the Ki-67 gene, RIP140 recruitment 
was strongly associated with E2 stimulation in ERp-only 
cells, consistent with our ChlP-Seq data (Supplementary 
Figure 3). 



Pathway analysis of the E2-regulated genes and 
effects on hormone-regulated proliferation and their 
association with ERoc- and ER|3-binding site clusters 

The GO analysis of the binding-site clusters was performed 
using web-based Metacore, DAVID, Cytoscape ClueGO plugin 
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and Genespring software programs (Dennis et al, 2003; 
Huang da et aU 2009), and the main results are shown 
in Figure 5 and Table 1. Of note, the major group of genes 
found to be differentially regulated between ERa and ER(3 cells 
was associated with regulation of the M-phase of the cell cycle 
(Figure 5A-C} (a complete description of GO terms associated 
with all of the binding-site clusters is given in Table I). 

The GO terms 'cell cycle related' were associated with 
E2-stimulated genes selectively in ERa cells, and this was 
abrogated in ERa/ER(3 cells. Interestingly, genes involved in the 
Gl-S transition of the cell cycle were upregulated by E2 in both 
ERa-only and ERp-only cells, but not in ERa/ER(3 cells, and genes 
involved in the G2-M transition were upregulated only in ERa- 
containing cells but not in the other two cell backgrounds, 
highlighting profound differences in regulation of cell cycle 
stages by each receptor, either alone or in combination (Figure 5, 
Table I, and Supplementary Figure 4} . We therefore examined in 
more detail the impact of ERa and ER(3 on cell proliferation in the 
three cell backgrounds, with a specific interest in understanding 
the role of ER(3 when present alone or in combination with ERa. 

As seen in Figure 6A, ERa cells showed a stimulation of 
prohferation in response to E2. The copresence of ERp greatly 
reduced prohferation, and cells expressing only ER(3 showed 
little prohferative stimulation by E2. Because of the interesting 
differences in regulation of cell cycle progression by each ER 
that were highhghted by the combination of gene expression 
and cistrome analyses, we went on to vahdate this also by flow 
cytometry (Figure 6B). As predicted, we observed an 
E2-dependent increase in S phase in both ERa and ER(3 cells, 
but not in ERa/ERp cells. Notably, even though there was a 
basal increase in G2-M phase in ER(3-containing cells, hgand- 



dependent changes in the proportion of G2-M phase cells were 
observed exclusively in ERa cells, and not in the other two cell 
types, indicating that the entire cell cycle machinery is 
activated only when ERa is present alone. The changes in cell 
prohferation and phases of the cell cycle were dependent 
on the RIP140 coregulator function, as knockdown of 
RIP140 abrogated the E2-mediated increase in cell number 
(Supplementary Figure 4A} and percent of cells in G2-M phase in 
all cell backgrounds (Supplementary Figure 4B} . Interestingly, in 
ER(3-containing cells RIP140 knockdown also increased the 
percent of S-phase cells, consistent with a report suggesting a 
corepressor function for RIP140 action with E2F1 (Docquier et al, 
2010} . There was also an increase in the subGl population when 
we introduced ERp, and this was further enhanced with RIP140 
knockdown, implying a corepressor function for RIP 140. However, 
RIP 140 overall acted as a coactivator for E2 -stimulated cell 
proliferation mediated by ERa. 

Our gene expression microarray data suggest that the effect 
of ERp might be due to reduced expression of important 
G2-M phase activators (Figure 6C and Supplementary 
Figure 5), such as FOXMl, as shown in ERa/ERp cells 
previously (Chang et al, 2006), and Ki-67 and BIRC5, which 
all showed a RIP140-dependent increase in expression level 
only in ERa cells (Supplementary Figure 4C) . 

Analysis of gene networks and pathways 
associated with the influence of ER|3: impact on 
cell proliferation, MAPK activation, adipogenesis, 
and the involvement of RIP140 

Our examination of the role of ERP in cell prohferation 
revealed that ERp affects cell prohferation in at least three 
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Figure 6 ERa supports strong E2-mediated cell proliferation; ERp reduces the proliferative stimulation of ERa in ERa/ERp cells, and ERp by itself supports very little 
if any hormone-enhanced proliferation. (A) Cell proliferation: ERa, ERa/ERp, and ERp cells in six-well plates were treated with Veh (0.01% EtOH) or 10 nM E2 (day 0). 
Treatment was repeated at day 2. Cell number was monitored by MTS assay. Treatment groups were monitored in triplicate in two separate experiments. (B) FACS 
analysis: cell cycle stages were analyzed by flow cytometry using BD-FACS Canto. Cells were fixed in 70% ethanol, stained for 30 min with 20 |ig/ml PI in Triton-X in the 
presence of DNAse-free RNAse A, and PI staining was measured. (C) Gene expression of four M-phase-related genes in cells treated with control vehicle or 10 nM E2 
for 24 h. mRNA level in vehicle-treated ERa cells was set at 1. 
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Figure 7 Impact of ERp on ERK1/2 activation, RIP140 gene expression, and adipogenesis, and an ERp and RIP140 gene signature and its association with overall 
survival and recurrence status of breast cancer patients. (A) Modulation of MARK pathway activation by ERp. MCF-7ERa cells were infected with the indicated amounts 
of ERp-adenovirus (multiplicity of infection, MOI) and were treated with 1 0 nM E2 for 1 5 min. ERK1/2 phosphorylation was monitored using pMAPK antibody (91 01 ; Cell 
Signaling). Total ERK2 (as loading control), and ERa and ERp were also monitored. C represents control cells infected with 40 MOI of AdGal and no AdERp. 
(B) Analysis of RIP1 40 mRNA levels in ERa, ERa/ERp, and ERp cells treated with control vehicle or 1 0 nM E2 for 24 h. (C) Regulation of genes involved in adipogenesis 
in the three cell types treated with vehicle or E2 for 24 h. (D) Total triglyceride levels in cells were measured. Assays were performed twice in triplicate. (E) Patient survival 
analysis using our 20-gene ERp and RIP140 signature and van de Vijver efa/(2002), Kao efa/(201 1), Finak efa/(2008) and Esserman efa/(2012) breast cancer data 
sets. All the data sets were obtained from the Oncomine database. (F) Hierarchical clustering of our signature genes using expression values obtained from the van de 
Vijver et al (2002), data set. (G) Comparison of average expression levels of the signature genes based on survival and recurrence data of patients. 
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important ways. As demonstrated in Figure 6, ER(3 changed 
the cell cycle profile and was unable to drive the cells into 
G2/M. This first mechanism could be due to the loss of 
central factors, such as FOXMl (Supplementary Figure 4C; 
Chang et aU 2006} , and a direct effect on cell cycle genes via the 
loss of binding of ERa when ERp is present, as shown by 
Cluster 1, and gain of genomic binding of ERp, as shown by 
Cluster 8, which is enriched for cell cycle and apoptosis-related 
genes. 

Second, we observed that ERa binds to sites near important 
pathway modulator genes (such as MAP2K1-MEK1 and 
MAP3K1-MEKK1]. In our previous studies (Madak-Erdogan 
et al, 2011} we showed that MAPK signaling was essential 
for E2-mediated cell prohferation. To test the impact of ERP on 
MAPK signaling, we monitored ERKl/2 activation after E2 
treatment of cells containing both receptors. Our findings 
(Figure 7 A) show that E2 increased pERKl/2 in cells contain- 
ing ERa only (C, control cells with ERa but no ER^} and that 
increasing concentration of ERP abrogated pERKl/2 
activation. 

A third mechanism we uncovered was the major influence of 
ERp on metabohc pathways (Figure 7 B-D} due to its 
preferential coupling with RIP140. We observed in our 
cistromes analysis that the number of genomic binding sites 
for RIP140, a well-known factor in metabohc regulation 
(Rosell et aZ, 2011; Siersbaek et al, 2012}, was dramatically 
increased in E2-treated cells containing ERp only. GO analysis 
of these binding sites (Table I, and see Supplementary Table 3 
for functional enrichment of pathway and process networks 
for ERp-modulated genes} revealed enrichment for genes 
associated with fatty acid metabohsm, regulation of NF-kB and 
inflammation, IGF-1 signaling, and mammary gland epithelial 
development. 

RIP140 is an estrogen-induced protein (Cavailles etaU 1995; 
Augereau et aU 2006} and is known to be a repressor of 
metabohc pathways (Christian et aU 2005; Rosell et aU 2011}, 
especially lipolysis in adipocytes. RIP140 was 3-fold upregu- 
lated by E2 in ERa and ERa/ERp cells but, most remarkably, 
RIP140 was more than 100-fold upregulated by hormone in 
ERP cells (Figure 7B} . Of note, we observed greatly elevated 
expression of key adipogenic factors, such as DDIT3, FOXOIA, 
and MEF2A, in ERp cells treated with E2, consistent with the 
binding data for ERs and coregulators (Figure 7C and 
Supplementary Figure 6} . This gene regulation was dependent 
on RIP140 and SRC3, as knockdown of either coregulator 
blocked the mRNA induction (Supplementary Figure 7}. 
Interestingly, recruitment of each coregulator was reduced 
when the other coregulator was depleted, suggesting a 
cooperative role for each coregulator in the recruitment of a 
functional complex. When we monitored mRNA expression 
and coregulator recruitment for other genes not associated 
with adipogenesis, such as pS2 or 0TUB2, we saw that SRC3 
was required for mRNA induction and RIP 140 recruitment in 
all cell backgrounds, while RIP140 showed a gene and cell 
background-specific function. For TFFl, RIP140 knockdown 
increased mRNA induction and interestingly increased recruit- 
ment of SRC3 in all receptor cell backgrounds. For 0TUB2, 
RIP140 knockdown in ERa/ERp cells increased mRNA 
induction and SRC3 recruitment, whereas both of these were 
blocked by the knockdown of RIP140 in ERp cells. These 
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results suggest that although RIP 140 acts as a coactivator or 
corepressor in a gene and cell background-dependent manner, 
adipogenesis requires high levels of RIP 140 that we observe in 
ERP cells. 

To further assess the effect of ERp and RIP140 on adipogenic 
properties of the breast cancer cells, we monitored the cellular 
level of triglycerides (Figure 7D}, which revealed a threefold 
increase in triglyceride levels in ERP cells. This effect was 
dependent on RIP140. 

These findings in MCF-7 cells were also verified in another 
ERa-positive cell line, T47D, where we introduced ERP and 
generated cells with ERa and/or ERp. Findings from cell 
prohferation (Supplementary Figure 8A}, adipogenesis 
(Supplementary Figure 8B}, and adipogenesis-related gene 
expression (Supplementary Figure 8C} studies in these T47D 
cells were consistent with our observations in MCF-7 cells. 
The results rule out phenotypic differences and transcriptional 
regulatory effects being unique to the MCF-7 cells. 
The observations in these MCF-7 and T47D breast cancer 
models also highhght several novel gene networks involved in 
delineating the phenotypic properties of the ERp cells and the 
suppression of prohferation by this receptor. 



A gene signature associated with RIP140 predicts 
better prognosis in breast cancer patients 

To examine possible importance of our gene targets in breast 
cancer patient prognosis, we used the Oncomine database to 
obtain a 20-gene signature associated with ERp- and RIP140- 
binding sites, and found that this signature predicted better 
prognosis for patients in several large clinical data sets. The 
initial gene list that we used to derive the signature included 
genes whose expression was modulated by ERp in our gene 
expression microarrays and harbored binding sites for ERP and 
RIP140 within 20 kb of the transcription start site of the gene. 
The gene signature predicted clinical outcome in 23 data sets 
from Oncomine (Figure 7E and Supplementary Figure 9}. 
Moreover, this signature was most predictive for outcome in 
breast cancer as compared with other cancers, as we observed 
an enrichment of concepts associated with breast cancer but 
not other cancer types (Supplementary Figure 9}. The 
signature genes included RIP 140; oxidoreductases ALDH3A2, 
GLUDU PRDX3, and GPX4; ribosomal proteins RPL22, RPLPl, 
and RPS15; negative regulators of gene transcription BNIP3L, 
DICERl, and TCEALl; cell cycle regulators RAD17, CREBL2, 
and CGRRFl; and ubiquitin modifiers TTC3 and UBE3A. These 
genes were coexpressed in breast tumors (Figure 7F}, and their 
overexpression marked tumors that had a better prognosis in 
terms of overah survival and reduced recurrence (Figure 7G 
and Supplementary Figure 10} . Thus, our integrative genomics 
approach revealed genes that are associated with better 
prognosis in breast cancer patients. 

Discussion 

The regulation of transcription in eukaryotes involves 
the assembly of multiprotein complexes at specific sites on 
chromosomal DNA that occurs in a temporally coordinated 
and cell-specific manner in response to specific stimuh 
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(Katzenellenbogen et al, 2000; Katzenellenbogen and 
Katzenellenbogen, 2002; Welboren et al, 2009; Madak- 
Erdogan et aU 2011; Ross-Innes et aU 2012). Herein we have 
shown that multiple rounds of data analysis using clustering 
algorithms have enabled us to delineate a hierarchy of ER- and 
coregulator-binding site subgroups that demarcate and govern 
gene expression programs and lead to the distinct physiolo- 
gical outcomes controlled by the closely related TPs, ERa and 
ERp. The two ERs have very specific cistromes and differen- 
tially engage SRC3 and RIP 140 at different chromatin sites to 
regulate distinct sets of target genes that can explain the ERa 
prohferative and ERp antiprohferative activities of these 
receptors. In fact, we find that ERa is capable of regulating 
genes involved with all phases of the cell cycle when expressed 
alone, whereas ERp when coexpressed with ERa dampens this 
response. When ERp is expressed alone, it appears capable 
of driving the cells into S phase, but is then incapable of 
successfully completing the cell cycle. Moreover, this action of 
ERP on cell cycle genes is complemented by ERP cell-selective 
effects on lipid metabohsm genes that can be linked to an ERp 
preferential usage of the coregulator RIP140. 

Given the clinical importance of ERa and ERP in breast 
cancer development and in the response of patients to breast 
cancer treatments (Ah and Coombes, 2000; Katzenellenbogen 
and Katzenellenbogen, 2000; Speirs etal 2004; Saji etal 2005; 
Deroo and Korach, 2006; Skliris et al 2006; Chang et al 2008; 
Nilsson and Gustafsson, 2011), these findings illuminate the 
molecular mechanisms that underhe the distinct differences in 
the biological phenotypes and in the endocrine therapy 
responses of ERa- and/or ERp-positive breast cancers. Our 
analyses of gene expression data on human breast tumors 
where we could divide tumors into those expressing ERa or 
ERP only, or both receptors, showed distinct gene expression 
profiles for these tumors that were consistent with our 
observations in MCP-7 cells having these three different 
complements of ERs. In this human tumor gene expression 
microarray study, major differences were observed in cell 
prohferation genes that were correlated with clinical outcome 
and patient survival (Gruvberger-Saal et al 2007) . Patients in 
this cohort, whose tumors were ERa-negative and ERp- 
positive, had a better outcome on tamoxifen, consistent with 
a reduced prohferation status. Moreover, when we combined 
our gene expression data and binding site analysis to obtain an 
ERp-specific gene signature, this included RIP140 as well as 
other genes whose overexpression predicted better prognosis 
for breast cancer patients. Although speculative, utilization of 
ERp-selective agonists (Meyers etal 2001; Harris, 2006; Charn 
et al 2010) and activation of ERP gene programs in ERa plus 
ERp-positive or ERp-only-containing breast cancers, or 
increasing levels of RIP140, which could have a growth 
suppressive activity through ERp, might provide a benefit for 
patients with ERp-positive breast tumors. 

Although there have been reports on the binding of ERa and 
SRC3 in cells containing only ERa (Lanz et al 2010), and we 
and others have shown effects of introduction of ERP on the 
expression of some ERa-regulated genes (Paruthiyil etal 2004; 
Chang et al 2006; Lin et al 2007a; Chang et al 2008; Charn 
et al 2010; Grober et al 2011 ; Nassa et al 2011) , this is the first 
study to comprehensively compare the binding of ERs and 
coregulators, and the effects of hormone on gene expression in 
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breast cancer cells containing ERa and ERp alone and together. 
To the best of our knowledge, this is also the first report of 
genome-wide ChlP-seq binding of RIP140 in any MCF-7 ER 
context, and the first for the SRC3 cistrome in ERa and ERp, 
and ERp-only cells. We selected these two central ER 
coregulators for the study, as SRC3, also known as amphfied 
in breast cancer- 1 (Anzick et al, 1997), is a coactivator that 
promotes estrogen-stimulated transcriptional activity by ERa 
(Anzick et al 1997; Suen et al 1998; Azorsa et al 2001; Louie 
et al, 2004); RIP140 is an especially interesting coregulator, as 
it can act either as a coactivator or a corepressor (Cavailles 
etal 1995;Rosell etal 2011). 

ER-positive breast cancers differ in their content of ERa and 
ERp, and as the cancer progresses to increasingly malignant 
character there is usually a decline in the level of ERp relative 
to ERa (Speirs et al 2004; Saji et al 2005). To examine how 
estrogen action in breast cancer proceeds through the two ERs, 
in terms of chromatin-binding site selection and gene 
regulation, to result in distinct cellular phenotypes, we 
constructed a model that was based on MCF-7 cells. We chose 
this as the cell background because it is the most well-studied 
breast cancer cell line in which much is known about ERa- 
binding sites as well as cistromes for certain other cofactors, 
such as FOXAl and GATA3, providing a broad context for our 
studies (Carroll etal 2005, 2006; Eeckhoute etal 2007; Lupien 
et al 2008; Hurtado et al 2011 ; Liu et al 2011 ; Ross-Innes et al 
2012) . In addition, the introduction of ERp results in a decrease 
in prohferation and a change in other cellular properties that 
mirror the effects of ERP in human breast cancers, based on 
gene microarray and patient outcome data (Gruvberger-Saal 
et al 2007). 



A multilayer model of progressive specification 
that underlies the distinctive activity of estradiol 
(E2) through ERoc versus ER|3 in breast cancer cells 

In this study, as schematized in the model in Figure 8, we have 
examined at multiple levels the ways by which estradiol (E2) 
acts through its closely related receptors in breast cancer cells 
to generate distinct phenotypic and prohferative outcomes. At 
the chromatin-binding level, we show that ERa and ERp select 
distinct, only partially overlapping sets of chromatin-binding 
sites when present alone; when present together, they compete 
for these sites, and restrict and shift each other's binding. 
There are also characteristic differences in the ERa and ERP 
sites in terms of TF-binding motif enrichments and genomic 
location. 

At a second level, the coregulators SRC3 and RIP140, known 
to have important roles in the mammary gland and in breast 
cancer (Cavailles et al, 1995; Font de Mora and Brown, 2000; 
Xu et al, 2000; Augereau et al, 2006), become recruited in a 
distinctive manner to some sites at which ERa and/or ERp 
were bound to chromatin, and some hormone-induced genes 
were strongly associated with ER-binding sites that were also 
co-occupied by these coregulators. Of note was the increased 
(ca. fourfold) number of RIP140-binding sites in the ERp cells, 
and also the greatly increased expression of RIP140 in ERP 
cells, highlighting a potentially preferential action of this 
coregulator with ERp. Both coregulators were highly recruited 
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Figure 8 Model depicting modulation of gene expression, metabolism, and proliferative properties of breast cancer cells by ERa and ERp, based on findings in this 
report. The three cell contexts are illustrated, ERa at top, ERa/ERp in middle (as both homo and heterodimers), and ERp at bottom. Sites are shown with both SRC3 and 
RIP140 (right), and RIP140 and other factors, but not SRC3 (left). Representative associations between binding site clusters and cellular regulations are illustrated. 
These include antiproliferative actions of ERp through utilization of novel mechanisms, including modulation of transcription by ERa, changes in MARK signaling, and 
alteration in adipogenesis in breast cancer cells. 



to chromatin together with ER after E2 treatment, as shown by 
a > 80% overlap between the binding sites of the coregulators 
and the binding sites of the ERs. The majority of ER-binding 
sites, however, did not show any colocalization with SRC3 or 
RIP140; at these sites, ER might be using other coregulators 
(McKenna and O'Malley, 2002; Green and Carroll, 2007). 
Recent studies have elegantly demonstrated the promoter- 
specific involvement of distinct coregulators in the regulation 
of estrogen-responsive genes in MCF-7 cells. Thus, SRC3 was 
present at nine of the ten genes examined in MCF-7 ERa cells, 
whereas fewer coregulators were involved in estrogen 
stimulation of other genes, such as the PgR gene (Won Jeong 
et aU 2012). 

The overlay of our ERa and ERp cistromes with our SRC3 
and RIP140 coregulator cistromes, and with genome-wide 
cistrome analysis of the FOXAl and GATA3 pioneering factors 
by others (Carroll et al 2005, 2006; Eeckhoute et al 2007; 
Lupien et al 2008), followed by clustering, enabled a 
stratification of binding-site patterns that predict time and 
direction of gene expression and phenotypic changes in each 
cell background. FOXAl and GATA3 are important DNA-bound 
cofactors of ERa (Carroll et al 2005, 2006; Eeckhoute et al 
2007; Lupien et al 2008), and they have been shown to be 
necessary for ERa chromatin binding and subsequent activa- 
tion of many ERa target genes. This additional layer of analysis 
allowed us to organize the ER-binding sites into 11 clusters 
having distinct characteristics in terms of predicted TF-binding 
site motifs, genomic location, conservation, associated genes 
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and enriched pathways. Finally, the direction of E2 regulation 
(stimulation or repression) of the target genes could be defined 
based upon clustering analysis of ER and coregulator 
cistromes. Of note, each cluster of binding sites could be 
associated with distinctive functional gene ontologies that can 
explain avenues through which ERa and ERp modulate cell 
prohferation and metabohsm. 



Specification of estrogen action at the 
chromatin layer 

Despite sharing great sequence and structural homology 
in the DNA-binding domain (Katzenellenbogen and 
Katzenellenbogen, 2000; Nilsson et al 2001; Deroo and 
Korach, 2006), ERa and ERp select distinct, only partially 
overlapping chromatin-binding sites, with more sites being 
accessed by each ER when present by itself. When both are 
together, mutual competition between them restricts the 
number of sites that are occupied by each ER subtype and 
contracts the number of overlapping sites; moreover, we also 
observe the appearance of many new binding sites for ERa and 
ERP when both are present, which indicates a profound 
redistribution of cistromes when the receptors are copresent. 

Although the two ERs can both homodimerize and hetero- 
dimerize, and both homo and heterodimers can be observed 
in vitro and in cells (Powell and Xu, 2008; Paulmurugan et al 
2011; Powell et al 2012), the potential number of chromatin- 
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binding sites through which ERa-ER(3 heterodimers might 
function (12 000 shared sites) is only a fraction (43%) of the 
total number of ER-binding sites that can be occupied by either 
ER in the ERa/ERp cells (28 000} . In addition, most of the new 
sites present only in ERa/ERp cells are associated uniquely 
with either ERa or ER(3, and thus can be occupied only by 
homodimers of ERa and ERp. Clearly then, the context of a 
chromatin-binding site imposes a significant specification of 
whether ER homo or heterodimers bind, regardless of their 
cellular abundance. 



Specification of ER action at the gene 
function-phenotypic layer 

Among the nuclear receptor family, it has proved challenging 
to associate ER chromatin-binding sites with estrogen- 
regulated genes, because many of the ER regulatory sites are 
in far-distal enhancer regions rather than in proximal promoters 
(Carroll et al 2005, 2006; Lin et aU 2007b; Madak-Erdogan 
et aU 2011). Nevertheless, through our genome- wide location 
and expression analyses, we were able to define several novel 
mechanisms underlying the effects of ERP on cell prohferation. 
By analyzing the effect of E2 on the cell cycle, we found that E2 
directly regulated genes involved in all the phases of the cell 
cycle in ERa cells; this was dramatically altered and reduced, 
however, with ERp copresence. Very interestingly, ERp, when 
present alone, was able to increase the percent of cells in S 
phase, but cells were then unable to complete the cell cycle 
because they became arrested at the G2/M transition. 

GO analysis of ER-binding site clusters revealed an 
intriguing model for ERP antiproHferative action. First, ERP 
impinges on cell cycle genes in multiple ways: by reducing ERa 
binding when ERP is present (cluster 1), by binding directly to 
promoters of cell cycle-regulated genes (cluster 8), and by 
reducing the levels of factors imphcated in G2/M transition 
(e.g., FOXMl). Cluster 8 was particularly interesting, as it was 
enriched in promoter proximal binding sites that contained 
overrepresented motifs for E2F TFs. The E2F family members 
consist of activators and repressors, and the fact that our 
findings show that ERp-binding sites in this specific cluster are 
more enriched with E2F motifs (Table I) raises the possibility 
that ERP might be preferentially binding to the chromatin near 
E2F-binding sites and utilizing some of the E2F members to 
affect the regulation of its target genes, consistent with their 
suggested involvement from prior studies (Strom et aU 2004; 
Stender et al 2007). 

ERp also exerted an extensive effect on MAPK signaling, 
which we previously showed to be essential for stimulation of 
cell proliferation by E2 in ERa-containing breast cancer cells 
(Madak-Erdogan et aZ, 2011). Moreover, ERp altered cell 
metabohsm that is central to cell growth, where it was found 
to act in several reinforcing ways. ERP increased the genomic 
binding and expression level of RIP140, a critical metabohc 
regulator, in ERp-only cells. These unique binding sites were 
related to genes involved in IGF-1 signaling, inflammation, 
and fatty acid metabohsm. Functional outcomes associated 
with increased binding of RIP140 included the ability of ERP to 
directly bind and increase the expression of adipogenesis 
genes, elevating triglyceride levels in ERp cells. 
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An integrative, whole-genome and clustering 
analysis for elucidating the progressive 
specification of the biological activities of NHR 
complexes 

Through integration of cistromes and transcriptomes, and by 
apphcation of clustering analyses, we have been able to define 
a combinatorial pattern of regulation by these factors, in 
response to hormone stimulation, that specifies gene regula- 
tion across binding site clusters. Importantly, gene expression 
thereby becomes predictable and allows association of multi- 
ple regulatory sequences with a pattern of control defined by 
the cell's receptor context and the time of hormonal exposure. 

Work in many laboratories, in both mammahan systems 
(Farnham, 2009} and in yeast (Beer and Tavazoie, 2004), has 
highlighted the difficulty in associating TF cistrome binding 
with gene expression outcomes. Our approach using clustering 
analyses, which we have used to elucidate patterns of binding 
of multiple regulatory factors and their association with gene 
regulation by two of the most important NHRs in human breast 
cancer, should be of help in characterizing the layers of 
specification that determine the regulation of transcription and 
distinct functional outcomes by any TF of interest. 

Materials and methods 

Cell culture, adenovirus, lentivirus, and siRNA, 
and ligand treatments 

MCF-7 cells were grown in minimal essential medium (MEM) (Sigma, 
St Louis, MO) , supplemented with 5 % calf serum (HyClone, Logan, 
UT), and 100|ig/ml penicillin/streptomycin (Invitrogen, Carlsbad, 
CA) (Chang et al, 2006, 2008). For experiments, the cells were 
maintained in phenol red-free MEM plus 5 % charcoal-dextran-treated 
calf serum for at least 3 days, and were then seeded at a density of 
3 X 10^ cells per 10-cm tissue culture dish (Corning, Corning, NY) for 
2 days before adenovirus infection. Recombinant adenoviruses were 
constructed and prepared as described (Chang et al, 2006). Adenoviral 
infection conditions were those described previously (Chang et al, 
2006; Frasor et al, 2006; Cham et al, 2010) and were used to generate 
MCF-7 cells expressing levels of ERp equal to that of the endogenously 
expressed ERa. ERp-only cells were generated from these cells by 
knockdown of ERa in parental cells using the following siERa 
sequences from Dharmacon: forward, 5'-UCAUCGCAUUCCUUG 
CAAAdTdT-3', and reverse, 5'-UUUGCAAGGAAUGCGAUGAdTdT-3'. 
siRNA experiments were performed as previously described, and 
resulted in knocknown of ERa mRNA and protein by greater than 95 % 
(Chang et al, 2008) . Briefly, cells were transfected with 20 nM siCtrl or 
siERa for 48 h after infection. Then, cells were treated with 0.1% 
EtOH (Veh) or 10 nM E2 (Sigma- Aldrich) for the indicated times. 
For coregulator knockdown experiments, 24 h after control or ERp 
adenovirus infection, cells were infected with control, SRC3 or RIP 140 
shRNA lentiviral particles according to the manufacturer's suggestions 
(Santa Cruz Biotechnology). All experiments were conducted with 
three or more replicates. 



Chip assays 

Chip assay was carried out as described (Barnett et al, 2008) for ERa, 
ERp, SRC3, and RIP140. Briefly, the different cells were treated with 
0.1% EtOH (Veh) or 10 nM E2 for 45 min. Chromatin was crosslinked 
using 1% formaldehyde for 15 min at room temperature. Cells were 
washed with PBS and were collected. After three times 10 s sonication 
in Chip lysis buffer, the samples were centrifuged for 10 min at 4 °C. 
The antibodies used for immunoprecipitation of DNA/protein com- 
plexes were ERa antibody HC-20 (Santa Cruz Biotechnology); ERp 
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antibodies were a combination with equal parts of CWK-F12 
(produced in our lab (Choi et al, 2001)), GTX70182 (GeneTex), GR40 
(Calbiochem), and PAl-311 (Affinity Bioreagents) ; SRC3 antibody SC- 
9119 (Santa Cruz Biotechnology); and R1P140 antibody SC-8997 (Santa 
Cruz Biotechnology) . Precipitated complexes were washed with RIPA 
buffer (three times) and TE buffer (2 times) . After overnight incubation 
at 68 °C, Chip DNA was isolated using QIAGEN PGR purification kit as 
per the manufacturer's suggestions. The ChIP DNA was used for 
ChlP-seq analysis and quantitative real-time PGR. 



ChlP-seq analysis and clustering 

The Chip DNA was prepared into libraries according to Illumina Solexa 
ChlP-seq sample processing methods, and single read sequencing was 
performed using the Illumina Solexa Genomic Analyzer. Sequences 
generated were mapped uniquely onto the human genome (hgl8) by 
ELAND. MACS algorithm (Zhang et al, 2008) was used to identify 
enriched peak regions (default settings) with a P-value cutoff of 
6.0e - 7 and FDR of 0.01 . ChlP-seq data will be available from the Gene 
Expression Omnibus database. 

The seqMINER density array method with a 300-bp window in both 
directions was used for the generation of clusters, i.e., groups of loci 
having similar compositional features (Ye et al, 2011). This ChlP-seq 
data interpretation platform allows the comparison and integration of 
multiple ChlP-seq data sets, and their extraction and visualization 
of specific patterns within data sets. A meta binding BED file 
was generated using ChlP-seq tool set, which included ER-binding 
sites from all three cell backgrounds (Blahnik et al, 2010). As there 
were only a relatively small number of coregulator-specific binding 
sites without any receptor binding, these sites were not included in the 
clustering step. This reference file was then used to map the binding 
sites for each factor from different cell backgrounds. BED files for each 
cluster were used for further analysis with Galaxy Cistrome integrative 
analysis tools (Venn diagram, conservation, CEAS; Liu et al, 2011). 



Gene Chip mRNA transcriptional profiling microarrays 

Total RNA was used to generate cRNA, which was labeled with biotin 
according to techniques recommended by Affymetrix. All analyses 
were done for three or more samples for each treatment. The biotin- 
labeled cRNA was then hybridized to Affymetrix U133 plus 2.0 
GeneChips, which contain oligonucleotide probe sets for over 47000 
transcripts. After washing, the chips were scanned and analyzed using 
Affymetrix processing software. GEL files were processed using 
GeneSpring GX 11.0 software (Agilent) to obtain fold-change and 
P-value with Benjamini and Hochberg multiple test correction 
(Hochberg and Benjamini, 1990) for each gene for each treatment 
relative to the vehicle control. We considered genes with fold-change 
>1.8 and P-value <0.05 as statistically significant, differentially 
expressed. 



Motif and GO category analysis 

Overrepresented GO biological processes were determined by the 
web-based DAVID Bioinformatics Resources database (Dennis et al, 
2003; Huang da etal, 2009) and the GeneSpring and web-based GREAT 
(Genomic Regions Enrichment of Annotations Tool) software (McLean 
et al, 2010). Motif-enrichment analysis was done using Seqpos, which 
uses TRANSFAG PWMs, JASPAR, and de novo motif discovery tools 
(Liu et al, 2011). 



Cell proliferation and flow cytometry 

MCF-7 cells were seeded at 30 000 cells/well in six-well plates. On the 
second day, the cells were infected with AdGal or AdERp at multiplicity 
of infection 20 for 4h and then the medium was changed. After 24 h, 
the cells were transfected with 20 nM siCtrl or siERa. At 24 h, cells were 
treated with Veh (0.01 % EtOH) or 10 nM E2 (day 0) and again at day 2. 
Cell number was monitored using the MTS assay (Bergamaschi et al, 
2011). Flow cytometry and analysis of cell cycle stage were conducted 
as described using BD-FACS Canto. Cells were fixed in 70% ethanol. 



stained for 30min with 20|ig/ml propidium iodide (PI; Molecular 
Probe) in Triton-X (Sigma) in the presence of DNAse-free RNAse 
A (Sigma), and PI staining was measured (Bergamaschi et al, 2011; 
Bhatt et al, 2012). 



Adipogenesis assay 

Adipogenesis was monitored in cells grown in full media and used the 
adipogenesis assay kit from Bio Vision (Milpitas, CA) that measures 
total triglycerides. Absorbance was measured at 570nm using a 
BioRad 680 Microplate Reader, and all assays were performed in 
triplicate. 



Tumor data sets and data analysis 

Gene lists for generation of the ER(3 and RIP140 signature were selected 
based on the modulation of gene expression in ERp-containing cells, 
and the presence of ERp- and RIP140-binding sites within 20 kb of the 
transcription start site of the genes. This gave us a list of 345 genes. We 
further examined this list using the Oncomine database to find genes 
that predicted molecular outcome in breast cancer data sets with an 
odds ratio of at least 2 and P-value of 0.0001 . The log2 median-centered 
intensity expression values for signature genes were obtained from the 
Oncomine database. This analysis yielded a 20-gene ERp and RIP140 
signature. Hierarchical clustering of data was performed and displayed 
using Cluster 3.0 and Java TreeView software for analysis and 
visualization. 



Survival analysis 

For breast cancer patient survival analysis, patients were stratified 
according to average expression value of the genes in the signature, 
and the top 30% and bottom 30% of patients were used for 
computation of Kaplan-Meier curves by the Cox-Mantel log-rank test 
and Gehan-Breslow-Wilcoxon test in Graphpad Prism. 



Data availability 

Gene expression and ChlP-Seq data will be available from the Gene 
Expression Omnibus database with the following accession numbers: 
meta study, GSE42349; gene expression, GSE42347; and ChlP-Seq, 
GSE42348. 



Supplementary Information 

Supplementary information is available at the Molecular Systems 
Biology website (www.nature.com/msb). 
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